NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Privacy Profiles for Private Selection

Koskela, Antti; Redberg, Rachel; Wang, Yu-Xiang (July 2024, Proceedings of Machine Learning Research)

Private selection mechanisms (e.g., Report Noisy Max, Sparse Vector) are fundamental primitives of differentially private (DP) data analysis with wide applications to private query release, voting, and hyperparameter tuning. Recent work (Liu and Talwar, 2019; Papernot and Steinke, 2022) has made significant progress in both generalizing private selection mechanisms and tightening their privacy analysis using modern numerical privacy accounting tools, e.g., Rényi DP. But Rényi DP is known to be lossy when (ϵ,δ)-DP is ultimately needed, and there is a trend to close the gap by directly handling privacy profiles, i.e., δ as a function of ϵ or its equivalent dual form known as f-DPs. In this paper, we work out an easy-to-use recipe that bounds the privacy profiles of ReportNoisyMax and PrivateTuning using the privacy profiles of the base algorithms they corral. Numerically, our approach improves over the RDP-based accounting in all regimes of interest and leads to substantial benefits in end-to-end private learning experiments. Our analysis also suggests new distributions, e.g., binomial distribution for randomizing the number of rounds that leads to more substantial improvements in certain regimes.
more » « less
Full Text Available
Tractable MCMC for Private Learning with Pure and Gaussian Differential Privacy

Lin, Yingyu; Ma, Yi-An; Wang, Yu-Xiang; Redberg, Rachel; Bu, Zhiqi (May 2024, ICLR 2024)

Posterior sampling, i.e., exponential mechanism to sample from the posterior distribution, provides ε-pure differential privacy (DP) guarantees and does not suffer from potentially unbounded privacy breach introduced by (ε,δ)-approximate DP. In practice, however, one needs to apply approximate sampling methods such as Markov chain Monte Carlo (MCMC), thus re-introducing the unappealing δ-approximation error into the privacy guarantees. To bridge this gap, we propose the Approximate SAample Perturbation (abbr. ASAP) algorithm which perturbs an MCMC sample with noise proportional to its Wasserstein-infinity (W∞) distance from a reference distribution that satisfies pure DP or pure Gaussian DP (i.e., δ=0). We then leverage a Metropolis-Hastings algorithm to generate the sample and prove that the algorithm converges in W∞ distance. We show that by combining our new techniques with a localization step, we obtain the first nearly linear-time algorithm that achieves the optimal rates in the DP-ERM problem with strongly convex and smooth losses.
more » « less
Full Text Available
Improving the Privacy and Practicality of Objective Perturbation for Differentially Private Linear Learners

Redberg, Rachel; Koskela, Antti; Wang, Yu-Xiang (December 2023, Advances in neural information processing systems)

In the arena of privacy-preserving machine learning, differentially private stochastic gradient descent (DP-SGD) has outstripped the objective perturbation mechanism in popularity and interest. Though unrivaled in versatility, DP-SGD requires a non-trivial privacy overhead (for privately tuning the model’s hyperparameters) and a computational complexity which might be extravagant for simple models such as linear and logistic regression. This paper revamps the objective perturbation mechanism with tighter privacy analyses and new computational tools that boost it to perform competitively with DP-SGD on unconstrained convex generalized linear problems.
more » « less
Full Text Available
Tractable MCMC for Private Learning with Pure and Gaussian Differential Privacy

Lin, Yingyu; Ma, Yian; Wang, Yu-Xiang; Redberg, Rachel E; Bu, Zhiqi (January 2024, The Twelfth International Conference on Learning Representations)

Full Text Available
Tractable MCMC for Private Learning with Pure and Gaussian Differential Privacy

Lin, Yingyu; Ma, Yian; Wang, Yu-Xiang; Redberg, Rachel E; Bu, Zhiqi (January 2024, The Twelfth International Conference on Learning Representations)

Full Text Available
Generalized PTR: User-Friendly Recipes for Data-Adaptive Algorithms with Differential Privacy

Redberg, Rachel; Zhu, Yuqing; Wang, Yu-Xiang (April 2023, Proceedings of Machine Learning Research)

The ''Propose-Test-Release'' (PTR) framework is a classic recipe for designing differentially private (DP) algorithms that are data-adaptive, i.e. those that add less noise when the input dataset is nice. We extend PTR to a more general setting by privately testing data-dependent privacy losses rather than local sensitivity, hence making it applicable beyond the standard noise-adding mechanisms, e.g. to queries with unbounded or undefined sensitivity. We demonstrate the versatility of generalized PTR using private linear regression as a case study. Additionally, we apply our algorithm to solve an open problem from ''Private Aggregation of Teacher Ensembles (PATE)'' -- privately releasing the entire model with a delicate data-dependent analysis.
more » « less
Full Text Available
Differentially Private Linear Sketches: Efficient Implementations and Applications

Zhao, Fuheng; Qiao, Dan; Redberg, Rachel; Agrawal, Divyakant; Abbadi, Amr El; Wang, Yu-Xiang (December 2022, Advances in neural information processing systems)

Linear sketches have been widely adopted to process fast data streams, and they can be used to accurately answer frequency estimation, approximate top K items, and summarize data distributions. When data are sensitive, it is desirable to provide privacy guarantees for linear sketches to preserve private information while delivering useful results with theoretical bounds. We show that linear sketches can ensure privacy and maintain their unique properties with a small amount of noise added at initialization. From the differentially private linear sketches, we showcase that the state-of-the-art quantile sketch in the turnstile model can also be private and maintain high performance. Experiments further demonstrate that our proposed differentially private sketches are quantitatively and qualitatively similar to noise-free sketches with high utilization on synthetic and real datasets.
more » « less
Full Text Available
Privately Publishable Per-instance Privacy

Redberg, Rachel; Wang, Yu-Xiang (December 2021, Advances in neural information processing systems)

We consider how to privately share the personalized privacy losses incurred by objective perturbation, using per-instance differential privacy (pDP). Standard differential privacy (DP) gives us a worst-case bound that might be orders of magnitude larger than the privacy loss to a particular individual relative to a fixed dataset. The pDP framework provides a more fine-grained analysis of the privacy guarantee to a target individual, but the per-instance privacy loss itself might be a function of sensitive data. In this paper, we analyze the per-instance privacy loss of releasing a private empirical risk minimizer learned via objective perturbation, and propose a group of methods to privately and accurately publish the pDP losses at little to no additional privacy cost.
more » « less
Full Text Available
Tree++: Truncated Tree Based Graph Kernels

https://doi.org/10.1109/TKDE.2019.2946149

Ye, Wei; Wang, Zhen; Redberg, Rachel; Singh, Ambuj (April 2021, IEEE Transactions on Knowledge and Data Engineering)

Full Text Available

Search for: All records